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(54) DISPLAY METHOD FOR TEXT RETRIEVAL RESULT AND DEVICE THEREFOR 

(57)Abstract: 

PROBLEM TO BE SOLVED: To improve both retrieval efficiency and 
accuracy by dividing a text set into plural groups based on the theme 
analysis result of every text, generates the theme sort information 
showing the attribute of every group, and displays these information in 
every group. 

SOLUTION: A retrieval engine 3 expands a retrieval expression based on 
a prescribed algorithm and extracts a relative text set 4 out of a 
document data base 2. A fuzzy gathering part 5a of a processing part 5 
divides the set 4 into plural groups based on the theme analysis result of 
every text, and a theme sort information generation part 5b generates 
the theme sort information showing the attribute of every group. A 
retrieval result display part 6 processes the acquired information (text 
set B, centroid D and theme sort information C) in a prescribed display 
mode and shows them to a user 7. As a result, the document retrieval 
result can be easily confirmed and the retrieval efficiency and accuracy 
can be improved owing to prevention of the retrieval omission. 
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CLAIMS 



[Claim(s)] 

[Claim 1] The division step which divides automatically the text set obtained by searching a database 
based on the given retrieval conditions into two or more groups using the theme analysis result of each 
text, The generation step which generates the theme classification information expressing the attribute 
of groups involved about each of each group which was obtained by said division step, The text browsing 
result method of presentation characterized by what the display step which classifies and displays the 
theme classification information of each group who asked at said generation step according to a group is 
provided for. 

[Claim 2] The division step which divides automatically the text set obtained by searching a database 
based on the given retrieval conditions into two or more groups using the theme analysis result of each 
text, The generation step which generates the theme classification information expressing the attribute 
of groups involved about each of each group which was obtained by said division step, The group 
goodness of fit calculation step which asks for the goodness of fit between the group and said retrieval 
conditions about each of each of said group, The text browsing result method of presentation 
characterized by what the display step which classifies the theme analysis information of each group 
who asked at said generation step according to a group, and displays it on descending of the goodness 
of fit for which it asked by said goodness of fit calculation step is provided for. 

[Claim 3] The division step which divides automatically the text set obtained by searching a database 
based on the given retrieval conditions into two or more groups using the theme analysis result of each 
text, Based on the analysis result of the contents of each text in said group, whenever [ affiliation / 
which computes whenever / affiliation / to the groups involved of each text ] a calculation step and in 
said two or more groups The text browsing result method of presentation characterized by what the 
selection step for choosing the group who becomes a text display object, and the display step which 
indicates the text in the group chosen at said selection step by contents at the order of whenever [ said 
affiliation / which was computed ] are provided for. 

[Claim 4] The division step which divides automatically the text set obtained by searching a database 
based on the given retrieval conditions into two or more groups using the theme analysis result of each 
text, Based on the analysis result of the contents of each text in said group, the goodness of fit 
calculation step which computes the goodness of fit to said retrieval conditions of each text, and in said 
two or more groups The text browsing result method of presentation characterized by what the 
selection step for choosing the group who becomes a text display object, and the display step which 
indicates the text in the group chosen at said selection step by contents at the order of said computed 
goodness of fit are provided for. 

[Claim 5] The division step which divides automatically the text set obtained by searching a database 
based on the given retrieval conditions into two or more groups using the theme analysis result of each 
text, Based on the analysis result of the contents of each text in said group, whenever [ affiliation / 
which computes whenever / affiliation / to the groups involved of each text ] A calculation step, Based 
on the analysis result of the contents of each text in said group, the goodness of fit calculation step 
which computes the goodness of fit to said retrieval conditions of each text, and in said two or more 



groups The group selection step for a display for choosing the group who becomes a text display object, 
The display-order criteria selection means for choosing whether the text in said each group is displayed 
in order of the goodness of fit to retrieval conditions, or it displays in order of whenever [ to groups 
involved / affiliation ], The text browsing result method of presentation characterized by what the 
display step which indicates the text in the group chosen at said group selection step for a display by 
contents at the order of the display-order criteria chosen with said display-order criteria selection 
means is provided for. 

[Claim 6] Said division step is the text browsing result method of presentation according to claim 1 to 5 
characterized by what the text set obtained by searching a database based on the given retrieval 
conditions is divided into two or more groups for using the fuzzy clustering method. 

[Claim 7] The theme classification information expressing the attribute of the groups involved generated 
at said generation step is the text browsing result method of presentation given in either claim 1 
characterized by what is been what expresses the attribute of groups involved by the group of some 
keywords, or claim 2. 

[Claim 8] The theme classification information expressing the attribute of the groups involved generated 
at said generation step is the text browsing result method of presentation given in either claim 1 
characterized by what is been what expresses the attribute of the section loop formation concerned by 
the short text, or claim 2. 

[Claim 9] Whenever [ affiliation / which performs fuzzy clustering to the description matrix of the text 
set obtained by searching a database based on the given retrieval conditions, and generates whenever / 
to each classification category / affiliation / for every document ] A generation step, Using whenever 
[ said affiliation / which was generated ] the document allotment step which assigns each document to 
1 or two or more classification categories, and in said two or more classification categories The 
classification category selection step for choosing the classification category used as a text display 
object, The text browsing result method of presentation characterized by what the display step which 
indicates the text in the classification category chosen at said classification category selection step by 
contents at the order of a goodness of fit to the group is provided for. 

[Claim 10] Said document allotment step is the text browsing result method of presentation according 
to claim 9 characterized by what each document is assigned for to the classification category of k high 
orders of whenever [ affiliation ]. 

[Claim 1 1] Said document allotment step is the text browsing result method of presentation according 
to claim 9 characterized by what each document is assigned for to the classification category which has 
a value whenever [ beyond a certain threshold alpha / affiliation ]. 

[Claim 12] Said document allotment step is the text browsing result method of presentation according 
to claim 9 characterized by what each document is assigned for to a classification category in 
consideration of the probability distribution of a category. 

[Claim 13] A division means to divide automatically the text set obtained by searching a database based 
on the given retrieval conditions into two or more groups using the theme analysis result of each text, A 
generation means to generate the theme classification information expressing the attribute of groups 
involved about each of each group which was obtained by said division means, The text browsing result 
display characterized by what a display means to classify and display the theme classification 
information of each group who asked with said generation means according to a group is provided for. 
[Claim 14] A division means to divide automatically the text set obtained by searching a database based 
on the given retrieval conditions into two or more groups using the theme analysis result of each text, A 
generation means to generate the theme classification information expressing the attribute of groups 
involved about each of each group which was obtained by said division means, A group goodness of fit 
calculation means to ask for the goodness of fit between the group and said retrieval conditions about 
each of each of said group, The text browsing result display characterized by what a display means to 
classify the theme analysis information of each group who asked with said generation means according 



-3- 



to a group, and to display it on descending of the goodness of fit for which it asked with said goodness 
of fit calculation means is provided for. 

[Claim 15] A division means to divide automatically the text set obtained by searching a database based 
on the given retrieval conditions into two or more groups using the theme analysis result of each text, 
Based on the analysis result of the contents of each text in said group, whenever [ affiliation / which 
computes whenever / affiliation / to the groups involved of each text ] a calculation means and in said 
two or more groups The text browsing result display characterized by what the selection means for 
choosing the group who becomes a text display object, and the display means which indicates the text 
in the group chosen with said selection means by contents at the order of whenever [ said affiliation / 
which was computed ] are provided for. 

[Claim 16] A division means to divide automatically the text set obtained by searching a database based 
on the given retrieval conditions into two or more groups using the theme analysis result of each text, 
Based on the analysis result of the contents of each text in said group, a goodness of fit calculation 
means to compute the goodness of fit to said retrieval conditions of each text, and in said two or more 
groups The text browsing result display characterized by what the selection means for choosing the 
group who becomes a text display object, and the display means which indicates the text in the group 
chosen with said selection means by contents at the order of said computed goodness of fit are 
provided for. 

[Claim 17] A division means to divide automatically the text set obtained by searching a database based 
on the given retrieval conditions into two or more groups using the theme analysis result of each text, 
Based on the analysis result of the contents of each text in said group, whenever [ affiliation / which 
computes whenever / affiliation / to the groups involved of each text ] A calculation means, Based on 
the analysis result of the contents of each text in said group, a goodness of fit calculation means to 
compute the goodness of fit to said retrieval conditions of each text, and in said two or more groups 
The group selection means for a display for choosing the group who becomes a text display object, The 
display-order criteria selection means for choosing whether the text in said each group is displayed in 
order of the goodness of fit to retrieval conditions, or it displays in order of whenever [ to groups 
involved / affiliation ], The text browsing result display characterized by what the display means which 
indicates the text in the group chosen with said group selection means for a display by contents at the 
order of the display-order criteria chosen with said display-order criteria selection means is provided for. 
[Claim 18] Said division means is a text browsing result display according to claim 13 to 17 
characterized by what the text set obtained by searching a database based on the given retrieval 
conditions is divided into two or more groups for using the fuzzy clustering method. 
[Claim 19] The theme classification information expressing the attribute of the groups involved 
generated with said generation means is a text browsing result display given in either claim 13 
characterized by what is been what expresses the attribute of groups involved by the group of some 
keywords, or claim 14. 

[Claim 20] The theme classification information expressing the attribute of the groups involved 
generated with said generation means is a text browsing result display given in either claim 13 
characterized by what is been what expresses the attribute of groups involved by the short text, or 
claim 14. 

[Claim 21] Whenever [ affiliation / which performs fuzzy clustering to the description matrix of the text 
set obtained by searching a database based on the given retrieval conditions and generates whenever / 
to each classification category / affiliation / for every document ] A generation means, Using whenever 
[ said affiliation / which was generated ] a document allotment means to assign each document to 1 or 
two or more classification categories, and in said two or more classification categories The classification 
category selection means for choosing the classification category used as a text display object, The text 
browsing result display characterized by what the display means which indicates the text in the 
classification category chosen with said classification category selection means by contents at the 
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order of a goodness of fit to the group is provided for. 

[Claim 22] Said document allotment means is a text browsing result display according to claim 21 
characterized by what each document is assigned for to the classification category of k high orders of 
whenever [ affiliation ]. 

[Claim 23] Said document allotment means is a text browsing result display according to claim 21 
characterized by what each document is assigned for to the classification category which has a value 
whenever [ beyond a certain threshold alpha / affiliation ]. 

[Claim 24] Said document allotment means is a text browsing result display according to claim 21 
characterized by what each document is assigned for to a classification category in consideration of the 
probability distribution of a category. 
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* NOTICES * 

JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1. This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2. **** shows the word which can not be translated. 
3.1n the drawings, any words are not translated. 



DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] This invention relates to the suitable text-browsing result method of 
presentation and the equipment for retrieval of a document database, divides automatically into two or 
more groups the text set obtained by searching a database based on the retrieval conditions given 
especially using the theme analysis result of each text, and relates to the text-browsing result method 
of presentation and the equipment which classify the theme classification information expressing each 
group's attribute acquired by this division according to a group, and displayed it. 
[0002] 

[Description of the Prior Art] As a conventional text browsing result display, what was indicated by 
JP,6-76004,A is known, for example. 

[0003] The database retrieval solution storing section in which this equipment stores a database 
retrieval result, A distance calculation means between retrieval solutions to compute the distance 
between each retrieval solution by seasoning with a user's control input two or more attribute value 
which said database retrieval solution has, A retrieval solution group division means to divide into the 
group of the number which specified the retrieval solution as the user using the distance between 
retrieval solutions, or the number defined beforehand, A group representation retrieval solution 
calculation means to compute the retrieval solution located near an affiliation group's center of gravity, 
It consists of a representation retrieval solution selection means as which a user is made to choose a 
specific retrieval solution out of each group's representation retrieval solution, and a retrieval solution 
display means in a group to display all the retrieval solutions in the group to whom the representation 
retrieval solution belongs. 

[0004] Namely, if it is in equipment conventionally [ this 1 it is what classifies the database (numeric 
value) retrieval solution structured by the non-overlapping technique into the number of classifications 
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which the user specified. It displays without carrying out ranking of all the retrieval solutions in the 
group who was made to choose the group who displays one retrieval solution nearest to a group's 
classified center of gravity at a time as pilot data, and expects it of a user (with no ranking), and was 
chosen. 
[0005] 

[Problem(s) to be Solved by the Invention] However, if shown in such a conventional text browsing 
result display (retrieval solution display), there was a trouble said that application in a destructuring 
database like the full text is difficult for the following reason. 

[0006] That is, when a representation retrieval solution is a representation document in a group since 
the representation retrieval solution of the center-of-gravity location in a group is displayed if it is in 
equipment such conventionally, since what expresses the contents of the representation document 
directly is not displayed but the whole document is displayed, it is hard to grasp the group's contents. 
That is, in order to show each group's classified theme semantics, it is desirable for there to be a 
specific past ** case in contents, to extract the attribute item group [Hike in common ] in a group 
rather, and to show a user only by display one retrieval solution nearest to a mere group's center of 
gravity at a time as pilot data. In addition, if it is in the case of a full-text search system, it is 
meaningless to show all attribute data as they are as pilot data, and a new definition of pilot data which 
can understand the contents of a document easily is desired. 

[0007] Moreover, if it is in equipment conventionally, since a group is not arranged in order of the 
goodness of fit to retrieval conditions, it is hard to choose the group who agreed for the purpose of 
retrieval. In addition, only by referring to the representation solution to, if it is in equipment 
conventionally, since the solution in a group is not located in a line in order of whenever [ to a group / 
affiliation ], even when it is hard to grasp a group's image, it is difficult [ it ] to grasp an image with 
reference to other solutions. That is, by the method displayed without carrying out ranking of all the 
retrieval solutions in the selected group, if the classification number of cases increases, a user's burden 
will benefit specification large to a retrieval result. In order to mitigate such a burden and to raise 
retrieval effectiveness, a ranking function which can promote specification in a retrieval result is desired. 
[0008] Furthermore, since usually has two or more themes, a document has a possibility of producing 
leakage in a retrieval result on the display of a document classification result by the conventional 
technique of classifying one document only into one cluster. Therefore, in case a theme classification is 
performed to a document-retrieval result set, the overlapping technique which allows belonging to the 
cluster from which plurality differs (the theme is expressed) is desired. 

[0009] The place which this invention is made in view of an above-mentioned trouble, and is made into 
that purpose The check over a document-retrieval result is made easy. In improvement in retrieval 
effectiveness, and a list Can aim at improvement in the retrieval precision by prevention of the leakage 
in retrieval, and it also becomes the guide of how moreover, the shown theme information extracts data 
efficiently and puts them. It is in offering the retrieval result method of presentation and equipment 
which enabled it to perform advanced adaptation retrieval (Relevance Feedback) using this added 
response indication. 
[0010] 

[Means for Solving the Problem] Invention of a publication to claim 1 (or claim 13) of this application 
The division step which divides automatically the text set obtained by searching a database based on 
the given retrieval conditions into two or more groups using the theme analysis result of each text (or 
means), The generation step which generates the theme classification information expressing the 
attribute of groups involved about each of each group which was obtained by said division step (or 
means) (or means), It is shown in the text browsing result method of presentation (or equipment) 
characterized by what the display step (or means) which classifies and displays the theme classification 
information of each group who asked at said generation step (or means) according to a group is provided 
for. 
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[001 1] Here, the text set of the homepage which exists on the text set remembered to be a ''database" 
by mass storage media, such as a hard disk and an optical disk, or the Internet is equivalent to this. 
[0012] Moreover, "theme analysis" means generating the information which shows the contents of the 
text directly, and the set of the keyword on the title in a document may be generated. In the gestalt of 
operation, the vector (Fi) which is expressing the document by the feature vector in document space is 
equivalent to this. 

[0013] Moreover, "theme classification information" means the information which shows the group's 
contents directly about the group of a text. Two methods of a keyword method and a text method are 
shown by the gestalt of operation. 

[0014] And since according to invention of this claim 1 (or claim 13) the information which expresses a 
group directly is added and it indicates by the partition according to a group, it becomes easy to grasp 
the overview of the group who constitutes a retrieval result, and the group selection for the next 
processing becomes very easy. 

[0015] Invention of claim 2 (or claim 14) of this application The division step which divides automatically 
the text set obtained by searching a database based on the given retrieval conditions into two or more 
groups using the theme analysis result of each text (or means), The generation step which generates 
the theme classification information expressing the attribute of groups involved about each of each 
group which was obtained by said division step (or means) (or means), The group goodness of fit 
calculation step which asks for the goodness of fit between the group and said retrieval conditions 
about each of each of said group (or means), The theme analysis information of each group who asked 
at said generation step (or means) It is shown in the text browsing result method of presentation (or 
equipment) characterized by what the display step (or means) which classifies according to a group and 
is displayed on descending of the goodness of fit for which it asked by said goodness of fit calculation 
step is provided for. 

[0016] And according to invention of this claim 2 (or claim 14), since it displays on said claim 1 (or claim 
1 3) in order of the goodness of fit to retrieval conditions in addition to the effect of the invention of a 
publication, the group who agreed for the purpose of retrieval can be chosen, checking the group's 
contents. 

[0017] Invention of claim 3 (or claim 15) of this application The division step which divides automatically 
the text set obtained by searching a database based on the given retrieval conditions into two or more 
groups using the theme analysis result of each text (or means), Based on the analysis result of the 
contents of each text in said group, whenever [ affiliation / which computes whenever / affiliation / to 
the groups involved of each text ] a calculation step (or means) and in said two or more groups The 
selection step for choosing the group who becomes a text display object (or means), It is shown in the 
text browsing result method of presentation (or equipment) characterized by what the display step (or 
means) which indicates the text in the group chosen at said selection step (or means) by contents at 
the order of whenever [ said affiliation / which was computed ] is provided for. 
[0018] And since the text in the selected group is displayed in order of whenever [ to a group / 
affiliation ] according to invention of this claim 3 (or claim 1 5), it becomes easy to grasp a definition of a 
group. 

[0019] Invention of claim 4 (or claim 16) of this application The division step which divides automatically 
the text set obtained by searching a database based on the given retrieval conditions into two or more 
groups using the theme analysis result of each text (or means), Based on the analysis result of the 
contents of each text in said group, the goodness of fit calculation step (or means) which computes the 
goodness of fit to said retrieval conditions of each text, and in said two or more groups The selection 
step for choosing the group who becomes a text display object (or means), It is shown in the text 
browsing result method of presentation (or equipment) characterized by what the display step (or 
means) which indicates the text in the group chosen at said selection step (or means) by contents at 
the order of said computed goodness of fit is provided for. 
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[0020] And since according to invention of this claim 4 (or claim 16) the group suitable for retrieval 
conditions is chosen and the text in it is further displayed in order of retrieval conditions, a suitable text 
is early displayed certainly to retrieval conditions rather than the case where a text is displayed in order 
of a goodness of fit without carrying out the group division of the retrieval result. 

[0021] Invention of claim 5 (or claim 17) of this application The division step which divides automatically 
the text set obtained by searching a database based on the given retrieval conditions into two or more 
groups using the theme analysis result of each text (or means), Based on the analysis result of the 
contents of each text in said group, whenever [ affiliation / which computes whenever / affiliation / to 
the groups involved of each text ] A calculation step (or means), Based on the analysis result of the 
contents of each text in said group, the goodness of fit calculation step (or means) which computes the 
goodness of fit to said retrieval conditions of each text, and in said two or more groups The group 
selection step for a display for choosing the group who becomes a text display object (or means), The 
display-order criteria selection step for choosing whether the text in said each group is displayed in 
order of the goodness of fit to retrieval conditions, or it displays in order of whenever [ to groups 
involved / affiliation ] (or means), It is shown in the text browsing result method of presentation (or 
equipment) characterized by what the display step (or means) which indicates the text in the group 
chosen at said group selection step for a display by contents at the order of the display-order criteria 
chosen with said display-order criteria selection means is provided for. 

[0022] And according to invention of this claim 5 (or claim 17), according to a user's purpose, the display 
order of a text is changeable. 

[0023] Invention of a publication to claim 6 (or claim 18) of this application In the text browsing result 
method of presentation (or equipment) given in either claim 1 (or claim 13) thru/or claim 5 (or claim 17) 
said said division step (or means) It is characterized by what the text set obtained by searching a 
database based on the given retrieval conditions is divided into two or more groups for using the fuzzy 
clustering method. 

[0024] And since the fuzzy classification (theme classification) by the contents of the theme is 
automatically performed by the orverlapping method to the document set discovered by a certain 
retrieval type according to invention given in this claim 6 (or claim 18), improvement in the retrieval 
precision by prevention of the leakage in retrieval is expectable. 

[0025] Theme classification information expressing the attribute of the groups involved from which 
invention of a publication is generated by claim 1 (or claim 13) or claim 2 (or claim 14) at said generation 
step (or means) in the text browsing result method of presentation (or equipment) of a publication at 
claim 7 (or claim 19) of this application is characterized by what is been what expresses the attribute of 
groups involved by the group of some keywords. 

[0026] And according to invention given in this claim 7 (or claim 19), the attribute of groups involved can 
be intuitively grasped through the group of some keywords. 

[0027] Theme classification information expressing the attribute of the groups involved from which 
invention of a publication is generated by claim 1 (or claim 13) or claim 2 (or claim 14) at said generation 
step (or means) in the text browsing result method of presentation (or equipment) of a publication at 
claim 8 (or claim 20) of this application is characterized by being what expresses the attribute of groups 
involved by the short text. 

[0028] And according to invention given in this claim 8 (or claim 20), the attribute of groups involved can 
be made to understand intelligibly for anyone through a short text. 

[0029] Invention of a publication to claim 9 (or claim 21) of this application Fuzzy clustering is performed 
to the description matrix of the text set obtained by searching a database based on the given retrieval 
conditions. Whenever [ affiliation / which generates whenever / to each classification category / 
affiliation / for every document ] A generation step (or means), Using whenever [ said affiliation / which 
was generated ] the document allotment step (or means) which assigns each document to 1 or two or 
more classification categories, and in said two or more classification categories The classification 
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category selection step for choosing the classification category used as a text display object (or means), 
It is shown in the text browsing result method of presentation (or equipment) characterized by what the 
display step (or means) which indicates the text in the classification category chosen at said 
classification category selection step (or means) by contents at the order of a goodness of fit to the 
group is provided for. 

[0030] And according to invention given in this claim 9 (or claim 21), each document is assigned to 1 or 
two or more classification categories using the overlapping technique, and since the text in the 
classification category chosen in that condition is indicated by contents at the order of a goodness of fit 
to that group, improvement in the retrieval precision by prevention of the leakage in retrieval can be 
aimed at in improvement in retrieval effectiveness, and a list. 

[0031] Invention given in claim 10 (or claim 22) of this application is characterized by what said 

document allotment step (or means) assigns each document for in the classification category of k high 

orders of whenever [ that affiliation ] in invention given in said claim 9 (or claim 21). 

[0032] And according to invention given in this claim 10 (or claim 22), the high order of whenever 

[ affiliation ] can be made to always display the document of the fixed number on claim 9 (or claim 21) 

about each classification category in addition to the effect of the invention of a publication. 

[0033] Invention given in claim 1 1 (or claim 23) of this application is characterized by what said 

document allotment step assigns each document for to the classification category which has a value 

whenever [ beyond a certain threshold alpha / affiliation ] in invention given in said claim 9 (or claim 21). 

[0034] And according to invention given in this claim 1 1 (or claim 23), the document which has a value 

whenever [ beyond the threshold alpha which is always in claim 9 (or claim 21) about each classification 

category in addition to the effect of the invention of a publication / affiliation ] can be displayed. 

[0035] Invention given in claim 12 (or claim 24) of this application is characterized by what said 

document allotment step assigns each document for in a classification category in consideration of the 

probability distribution of a category in invention given in said claim 9 (or claim 21). 

[0036] And according to invention given in this claim 12 (or claim 24), the document which relates to 

claim 9 (or claim 21) in consideration of the probability distribution of a category about each 

classification category in addition to the effect of the invention of a publication can be displayed. 

[0037] 

[Embodiment of the Invention] Below, the gestalt of suitable operation of this invention approach and 
equipment is explained at a detail, referring to an accompanying drawing. 

[0038] First, the functional block diagram of drawing 1 shows notionally the configuration of the text 
browsing equipment with which this invention approach and equipment were applied. In this drawing, 1 is 
the retrieval question input section (it is described as Query Input) for inputting the feedback retrieval 
question (FeedBack Query) which should be inputted at the time of the original retrieval question 
(Original Query) and retrieval narrowing down which should be inputted at the time of retrieval initiation, 
and, specifically, is constituted as everyone knows by the software for an input processed in control 
units and those signals, such as a mouse and a keyboard. 

[0039] 2 is a text (document) database (it is described as Doc.DB) equivalent to the text set used as 
the candidate for retrieval, and the text set of the homepage which specifically exists on the text set 
memorized by mass storage media, such as a hard disk and an optical disk, or the Internet is equivalent 
to this. 

[0040] 3 is a search engine (it is describe as Search Engine ) locate in the center of a text retrieval 
system , according to a predetermined algorithm , a retrieval type is develop concrete as everyone 
knows by making into retrieval conditions the original retrieval question (Original Query ) and the 
feedback retrieval question (FeedBack Query ) which are input from the above-mentioned retrieval 
question input section 1 f and the software to extract corresponds to this the text set relate from the 
above-mentioned document database 2 . 

[0041] 4 is the related text set (it is described as Relevant Doc. Retrieved) which did in this way and was 
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extracted by the search engine (Search Engine) 3, and this text set 4 is set as the object of the 
processing processing in this invention so that it may mention later. 

[0042] Fuzzy grouping section (it is described as Fuzzy Clustering) 5a equivalent to a division means by 
which 5 is the processing processing section equivalent to the important section of this invention, and 
this processing processing section 5 divides the text set 4 into two or more groups automatically using 
the theme analysis result of each text, In this way, theme classification information generation section 
(it is described as Subject Generation) 5b which generates the theme classification information 
expressing the attribute of groups involved about each of each group which was obtained is constituted 
as a core. 

[0043] An operation of fuzzy grouping section (Fuzzy Clustering) 5a and theme classification information 
generation section (Subject Generation) 5b is notionally shown in drawing 2 . In this drawing, the field 
surrounded as the continuous line shown with a sign 4 expresses the whole text set (Relevant 
Doc. Retrieved) extracted by the search engine (Search Engine) 3. 

[0044] Three fields which similarly were surrounded with the broken line shown with Signs 4a, 4b, and 4c 
express each of three groups which was divided in the fuzzy grouping section (Fuzzy Clustering) 5. 
[0045] The black painting trigonum mark shown with Sign A expresses the original retrieval question 
(Original Query) inputted at the time of retrieval initiation. The round mark containing x shown with Sign 
B expresses each of each configuration text of the text set 4 in which the retrieval extract was carried 
out by the input of the original retrieval question (Original Query) A. 

[0046] Three void trigonum marks shown with Signs calcium, Cb, and Cc express the theme 
classification information (Group Subject) expressing Groups' 4a, 4b, and 4c attribute. In addition, such 
theme classification information calcium, Cb, and Cc is suitable also as a feedback retrieval question 
(FeedBack Query), if it uses for retrieval narrowing-down ********. 

[0047] Three black painting round marks shown with Signs Da, Db, and Dc express Groups' 4a, 4b, and 
4c center of gravity. Similarly, the black painting square mark shown with Sign D expresses the center of 
gravity of the text set 4. 

[0048] Fuzzy grouping section (Fuzzy Clustering) 5a divides the text set 4 into the groups [ two or more 
(this example three pieces) ] 4a, 4b, and 4c by performing well-known fuzzy clustering processing to the 
text set 4 obtained as a result of retrieval so that clearly from drawing 2 . On the other hand, theme 
classification information generation section (Subject Generation) 5b generates the theme classification 
information calcium, Cb, and Cc expressing the attribute of groups involved about each of each groups 
4a, 4b, and 4c which was obtained in this way. The theme classification information calcium, Cb, and Cc 
expressing the attribute of the groups involved acquired by doing in this way differs in each groups' 4a, 
4b, and 4c centers of gravity Da, Db, and Dc, and becomes what expressed each group's attribute 
directly surely so that clearly from drawing. In addition, the contents of processing of such fuzzy 
grouping section (Fuzzy Clustering) 5a and theme classification information generation section (Subject 
Generation) 5b are explained in more detail later. 

[0049] the information (the text set B, a center of gravity D, theme classification information C) which it 
returns to drawing 1 , and 6 is the retrieval result presentation section (it is described as Hit Window) 
which is equivalent to the important section of this invention similarly, and was acquired by the above- 
mentioned progress in this retrieval result presentation section (Hit Window) 6 — predetermined display 
voice — after processing it like, it shows to a user (it is described as User) 7. Those display modes are 
also later explained to a detail. 

[0050] In addition, in drawing 1 , the information flow by equipment is shown in coincidence 
conventionally which was expressed by the information flow and broken line by this invention expressed 
by the continuous line. Namely, in the retrieval result presentation section (Hit Window) 6, if it is in 
equipment conventionally, as shown in the broken-line arrow head 8, when the number of texts which it 
is only showing a user 7 as it is the text set B obtained as a result of retrieval, and is contained in the 
text set B in this case is abundant, as for a user, inconvenience is caused to discovering the target text. 
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If it is in this invention, on the other hand, in the retrieval result presentation section (Hit Window) 6 As 
shown in the continuous-line arrow head 9, only not only in the text set B obtained as a result of 
retrieval Since a user 7 will be shown even the theme classification information (Group Subject) C at the 
center-of-gravity (Clustercentroids) D list of each classification, Especially, it becomes possible by 
making this theme classification information (Group Subject) C into a key to discover the target text 
easily. Namely, as shown in the continuous-line arrow head 10 The acquired theme classification 
information C (it is equivalent to CI, C2, and C3 of drawing 2 ) thus, as it is If it gives the retrieval 
question input section (Query Input) 1 as a feedback retrieval question (FeedBack Query) C (signs that 
a retrieval question branches by the continuous-line arrow head 1 1 of drawing 2 R> 2 "Query Splitting" 
are shown), the text set 4 will be narrowed down exactly. The target text can be discovered easily, 
namely, advanced adaptation retrieval (relevance feedback) can be made to perform. 
[0051] Next, the text browsing equipment explained notionally above is explained to a detail with 
reference to the drawing below drawing 3 focusing on data processing for realizing the screen-display 
mode and it further. 

[0052] The whole data. processing in the text browsing equipment concerning this invention is shown in 
the General flow chart of drawing 3 . In addition, the processing shown in this General flow chart is 
started by choosing one of the menu item of that in a predetermined system menu. 
[0053] If processing is started in this drawing, a retrieval screen will be displayed by the predetermined 
display mode on the screen of the image display machine which constitutes retrieval equipment (step 
301). Thus, an example of the retrieval screen displayed is shown in drawing 4 . As shown in this drawing, 
the display screen is constituted by the longwise rectangle-like window W1, and the part of the up 
abbreviation 1/3 is made into the retrieval question input area A1, and let the part of the lower 
abbreviation 2/3 be the retrieval result output area A2. 

[0054] Into the retrieval question input area A1, the window W2 for a retrieval question input prepares, 
and is carried out. To this window W2 up side the input guide sentence (Enter Query in plain English) 12 
— moreover, in the right-hand side The start button 13 for giving the starting command to the search 
engine (Search Engine) 3 mentioned above (it is described as the inside O.K. of drawing), The 
cancellation carbon button (it is described as the inside CANCEL of drawing) 14 for canceling a retrieval 
question (Query) and the help button (it is described as the inside HELP of drawing) 15 for asking for 
actuation exchange etc. from a system are formed. 

[0055] In the retrieval result output area A2, window W3 for a retrieval result output is prepared, and the 
scroll bar 16 is formed in the right-hand side of this window W3. furthermore, to this retrieval result 
output area A2 down side The whole sentence demand carbon button 17 for requiring a text whole 
sentence output as a retrieval result (it is described as Full Text in drawing), The QBE carbon button 18 
and the classification-ized demand carbon button 19 for requiring classification-ization of a retrieval 
result (it is described as the inside Grouping of drawing), The abstract demand carbon button (it is 
described as the inside Summarize of drawing) 20 for requiring a text abstract output as a retrieval 
result and the reset button (it is described as the inside Back of drawing) 21 for returning a screen to 
the initial output state of a retrieval result are formed. 

[0056] In addition, it cannot be overemphasized that it is performed by click actuation of a mouse etc. 
after actuation of various kinds of above carbon buttons 13, 14, 15, 16, 17, 18, 19, 20, and 21 moves 
cursor to the carbon button of hope. 

[0057] And when a retrieval question is inputted like "I wahtto know Clinton's political condition." with a 
natural language (especially this example English) from a keyboard according to the input guide sentence 
(Enter Query in plain English) 12, this inputted retrieval question 22 will be displayed in a window W2. 
[0058] In this condition, if a start button (it is described as the inside O.K. of drawing) 13 is operated 
Return to drawing 3 , retrieval/display process is performed, and a search engine (Search Engine) 3 is 
started. The text set 4 relevant to a retrieval question is extracted from the document database 2, each 
configuration text of the extracted text set is sorted by this order with a high goodness of fit with the 
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retrieval question 22, and only that title 23 is displayed in window W3 (step 302). Moreover, the 
goodness of fit marks 24a, 24b, and 24c which classify the goodness of fit to the retrieval question of 
the text concerned into three steps ("quantity", "inside", low [ "low" ]), and express it are displayed on 
the head part of the title 23 of each text. Here, goodness of fit mark 24c goodness of fit mark 24b 
goodness of fit mark 24a shown by the round mark of black painting out is indicated to be to a goodness 
of fit "quantity" by the round mark of gray painting out is indicated to be to a goodness of fit "inside" 
by the round mark of void supports the goodness of fit "low", respectively. 

[0059] Henceforth, it returns to drawing 3 and will be in the condition of standing by selection of a text 
processing facility to a system side (step 303). In this condition, actuation of the classificationHzed 
demand carbon button (Grouping) 19 shown in the screen of drawing 4 performs classificationHzed 
processing which is the important section of this invention (step 306). 

[0060] The detail of classificationHzed processing is shown in drawing 5 . If processing is started in this 
drawing, it will be in the condition of standing by assignment of classification group number g, by showing 
a predetermined initial screen format (step 501). In this condition, completion of assignment (this 
example "5") of classification group number g performs in order extract processing (step 502) of the 
document characteristic quantity which is the description part of this invention, fuzzy clustering 
processing (it is described as Fuzzy Clustering) (step 503), and generation processing (step 504) of 
theme classification information. 

[0061] In extract processing (step 502) of document characteristic quantity, generation of document 
abstraction and a document feature vector is performed as follows. A document is expressed by the set 
(vector which uses a word as a component) of eclipse **** with weight, and the set of a document is 
expressed as a matrix which uses a word as a component. Therefore, a document is abstracted like 
several 1 by starting automatically the word (important word) used as the description of each document, 
making the class of word into Dimension m, and using the vectorial representation Fi to which each 
element is proportional to the frequency of occurrence of the word of a document unit. 
[0062] 
[Equation 1] 
Fi=(fil, E2, w , fim) 

F i <n*&WLS<? f/v 

f i i : HIS i vxm i k itt * fi* 



The example of a document vector set is shown in Table 1. The weight (for example, frequency) of the 
important word (Clinton, Singapore, China — ) included in each of the configuration document (F1, F2, F3 
— ) of a document set is shown by this example. 
[0063] 
[Table 1] 





Clinton Singapore China ... 


Fl 


0.8 0.4 0.0 


F2 


0.6 0.7 0.0 


F3 


0.5 0.0 0.7 








311 
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The example which developed the document vector set shown in Table 1 to document space is shown in 
drawing 6 . In this example, each configuration document (F1 , F2, F3 — ) of a document set is developed 
by the document space which sets an axis of coordinates as the above-mentioned important word 
(Clinton, Singapore, China — ). 

[0064] the description matrix of the document set as a retrieval result in the continuing fuzzy clustering 
processing (step 503) — receiving — well-known FCM — two kinds of classification information (Vc, Ui) 
as follows is generated by performing fuzzy clustering using law. 

[0065] 1) the representation document feature vector Vc of each classification — [Equation 2] 
Vc=(vd, vc2, van) 

vg \frmy*>-7g<oftmxmnr 

2) whenever [ to each classification category of each document / affiliation ] — Ui — [Equation 3] 
Ui=(uil, ul2, ~, uig) 

u i c : i <o&m?fl'-7 c ^mmst 



&3 

The example of whenever [ document classification affiliation ] is shown in Table 2. Whenever [ affiliation 
/ of each document ] (UI, U2, U3 — ) is shown to each classification group (G1, G2, G3 — ) of every by 
this example. 
[0066] 
[Table 2] 





Gl 


G2 ... Gfl 


Ul 


0.7 


0.2 ... 


U2 


0.8 


0.1 ... 


U3 


0.3 


0.6 



12 

Generation of classification theme information is performed by two kinds of methods as follows in 
generation processing (step 504) of the continuing classification theme information. 
[0067] 1) The keyword method of keyword ****** is a method which expresses each classification 
group's theme with the combination of some keywords, and can consider two kinds of methods as 
follows for the extract of a keyword in that case. The 1st method extracts k words of an element with 
the high weight in the representation document vector Vc of an applicable classification in order, and 
those words are used for it as information showing the group's theme. The 2nd method elects r 
document vectors as the high order of whenever [ affiliation ] to the document set of an applicable 
classification, extracts k words in the r document vector set sequentially from what has the high number 
of appearance documents, and they are used for it as information showing the group's theme information. 
[0068] 2) By the text method of text ******, extract the text which owns most those keywords by 
character string collating per sentence using the keyword theme information acquired by the keyword 
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method to the text (a title is included) of r head paragraphs of a document elected in order for the 
above-mentioned keyword method to generate theme information, and use the text sentence as the 
group's theme information. 

[0069] Thus, it will be shown to a user in predetermined presentation sequence so that it may mention 
later, theme information, i.e., classification theme information, of each obtained group (an above- 
mentioned keyword group or an above-mentioned title sentence etc.). Here, if the goodness of fit to the 
retrieval type of Ri and a classification group is set to GRc for the goodness of fit to the retrieval 
question of the searched document i, several 4 relation will be materialized among both. 
[0070] 
[Equation 4] 

GRc= f R i/r c 
i 

Here, how to ask for the number rc (c= 1 , — , g;g: the number of classifications) of documents which was 
shown in several 4 and which was elected as the high order of whenever [ affiliation ] to Group c is 
shown in the flow chart of drawing 7 . If processing is started, after initializing rc in this drawing (step 
701) (rc=0), Whenever ! greatest affiliation ] is called for from the line data Ui of whenever [ affiliation / 
Of Document i ] (step 702). The number rc of members of the maximum and the corresponding group c 
is added (step 703). The above processing (step 702,703) adding i every [ 1 / +], it will be repeated until 
the aggregate value serves as i=n (the number of documents) (step 705 YES), and as a result, finally, the 
value of rc will be calculated (step 704). 

[0071] Thus, completion of generation (the decision of presentation sequence is included) of 
classification theme information starts dynamic display processing of the retrieval result using the 
theme classification information searched for (step 505). (step 504) 

[0072] The detail of the dynamic display process of a retrieval result is shown in the flow chart of 
drawing 8 . If processing is started in this drawing, as the retrieval result output area A2 set up on the 
screen of the image display machine which constitutes retrieval equipment is shown in drawing 9 or 
drawing 10 , 2 ****s will be carried out up and down, and, thereby, the window W4 for theme 
classification information displays (Subject Window) and the window W5 for a retrieval result output (Hit 
Window) will appear, and the window W4 for theme classification information displays (Subject Window) - 
- setting — predetermined display voice — presentation of each classification theme information is 
performed more like (step 801). As mentioned above, presentation of each of this classification theme 
information is performed by a keyword method and the text method. 

[0073] An example of the display screen by the keyword method is shown in drawing 9 R> 9. In addition, 
the searched text set is divided into five classification groups in this example. In the window W4 for 
theme classification information displays (Subject Window), as shown in this drawing, as the left brink 
section is met, five group carbon buttons 25-29 corresponding to the classification group number "1" - 
classification group number "5" are arranged at the vertical single tier, and the keyword groups 30-34 
which express the theme of the classification group concerned exactly are arranged on the right-hand 
side of those group carbon buttons 25-29. In this example, on the right-hand side of the group carbon 
button 25 corresponding to the classification group number "1" As a keyword group 30, 
"SINGAPORE;CANE;PUNISH;US" is displayed and on the right-hand side of the group carbon button 26 
corresponding to the classification group number "2" As a keyword group 31, 

"DALAILAMA;MEET;CHINA;TIBET" is displayed and on the right-hand side of the group carbon button 
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27 corresponding to the classification group number "3" As a keyword group 32, 

"MEET;LEADER;GOVERNMENT;OFFICIAL" is displayed and on the right-hand side of the group carbon 
button 28 corresponding to the classification group number "4" as a keyword group 33 
"NIXON;NATION;SINGAPORE;DIRECTIVE" is displayed and on the right-hand side of the group carbon 
button 29 corresponding to the classification group number "5" 
"QUESTION;CHARACTER;PEOPLE;POLlTICS" is displayed as a keyword group 34. 
[0074] Moreover, such theme classification information is arranged sequentially from what has a high 
goodness of fit with a retrieval question (Query) according to the presentation sequence searched for 
previously. That is, the theme symbolized with the classification group number "1" in this example will 
have the highest goodness of fit with a retrieval question, and a goodness of fit with a retrieval question 
will be the lowest for the theme symbolized with the classification group number "5." Therefore, from 
the display sequence in the window W4 for theme classification information displays (Subject Window), a 
user 7 can know easily the classification group nearest to the information which he is looking for, and 
can check each classification group's theme roughly based on the contents of the keyword groups 30- 
34 which moreover express each contents directly. And as the original retrieval question is met, retrieval 
narrowing down can be efficiently performed by starting classification result display processing (step 
802), so that it may explain in detail later. 

[0075] An example of the display screen by the text method is shown in drawing 10 R> 0. In addition, the 
searched text set is divided into five classification groups also in this example. In the window W4 for 
theme classification information displays (Subject Window), as shown in this drawing, as the left brink 
section is met, five group carbon buttons 25-29 corresponding to the classification group number "1" - 
classification group number "5" are arranged at the vertical single tier, and the short text sentences 
35-39 which express the theme of the classification group concerned exactly are arranged on the right- 
hand side of those group carbon buttons 25-29. In this example, on the right-hand side of the group 
carbon button 25 corresponding to the classification group number "1" "Clinton Protest Singapore 
Caning.Mulls Response" is displayed as a text sentence 35. On the right-hand side of the group carbon 
button 26 corresponding to the classification group number "2" As a text sentence 36, "Clinton Meets 
With Dalai Lama" is displayed and on the right-hand side of the group carbon button 27 corresponding 
to the classification group number "3" as a text sentence 37 "IndianLeader Meet Clinton" is displayed 
and on the right-hand side of the group carbon button 28 corresponding to the classification group 
number "4" As a text sentence 38, "Nixon Had LivingWill" is displayed and on the right-hand side of the 
group carbon button 29 corresponding to the classification group number "5" "Clinton News Conferens- 
Text" is displayed as a text sentence 39. 

[0076] Moreover, according to the presentation sequence searched for previously, it is arranged also 
about such theme classification information sequentially from what has a high goodness of fit with a 
retrieval question (Query). That is, a classification group's theme symbolized with the classification 
group number "1" in this example will have the highest goodness of fit with a retrieval question, and a 
goodness of fit with a retrieval question will be the lowest for a classification group's theme symbolized 
with the classification group number "5." Therefore, from the display sequence in the window W4 for 
theme classification information displays (Subject Window), a user 7 can know easily the classification 
group nearest to the information which he is looking for, and can check each classification group's 
theme roughly based on the contents of the text sentences 35-39 which moreover express each 
contents directly. And as the original retrieval question is met, retrieval narrowing down can be 
efficiently performed by starting classification result display processing (step 802), so that it may explain 
in detail later. 

[0077] Next, the processing for the last display of a retrieval result using Ui is explained to a detail 
whenever [ to each classification group of each document obtained by the fuzzy clustering explained 
previously / affiliation ]. In addition, in this example, three kinds of processings are prepared for the last 
display of a classification result, and these processings are started in the screen shown in drawing 9 or 
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drawing 10 by operating any one of the group carbon buttons 25-29 (step 802). 

[0078] it explained previously — as — the description matrix of the document set as a retrieval result in 
this invention — receiving — FCM — fuzzy clustering is performed using law and, thereby, Ui is 
calculated whenever [ to each classification category of each document / affiliation ]. Now, temporarily, 
five documents (001, 002, 003, 004, 005) exist, and it is assumed about each of those documents that it 
is as whenever [ to each of three classification categories (a category 1, a category 2, category 3) / 
affiliation ] being Table 3. 
[0079] 
[Table 3] 





UT-zfV 1 


&?-=f') 2 


3 


0 0 1 


0. 5 0 


0 . 3 0 


0.2 0 


0 0 2 


0. 6 0 


0.10 


0.3 0 


0 0 3 


0. 10 


0.8 0 


0.10 


0 0 4 


0. 2 5 


0. 3 4 


0.4 1 


0 0 5 


0. 3 5 


0. 1 0 


0.5 5 



& 3 

3 kinds of display-processing (1) - (3) of a fuzzy classification result is explained to the origin of the 
above premise. 

[0080] (1) If it is shown in this display processing when assigning to the classification category of k high 
orders of whenever [ affiliation / of each document ], each document (001-005) is assigned to k 
classification categories chosen sequentially from the high thing of whenever [ affiliation ]. If k= 1 
(binary-ized method), about a document (001), in for example, the category 1 which is 0.50 whenever 
[ maximum affiliation ] About a document (002), in the category 1 which is 0.60 whenever [ maximum 
affiliation ] About a document (003), it is assigned [ document / (004) / document / (005) ] to the 
category 3 which is 0.55 whenever [ maximum affiliation ] by the category 3 which is 0.41 whenever 
[ maximum affiliation ] at the category 2 which is 0.80 whenever [ maximum affiliation ], respectively, 
this — a classification category (G1, G2, G3) — if it arranges independently — category G1= (001 
002) ;N1=2 category G2= (003) 2= ;N1 category G3= (004 005) ; — several documents which serve as N 
3= 2 and are contained in the classification group G1 — several documents with which N1 is contained 
in two pieces and the classification group G2 — several documents with which N2 are contained in one 
piece and the classification group 3 — N3 are made into two pieces. And the document it was 
presupposed that it did in this way and was belonged to each category will be displayed in the window 
(HitWindow) W5 for a retrieval result output with assignment of the group number so that it may explain 
to a detail later. 

[0081] An example of the program for realizing the above display processing (1) is shown in drawing 1 1 . 
If processing is started in this drawing, after performing setting processing (step 1 101) of k value, and 
initialization processing (step 1 102) of i f c, and Nc, The sorting application to the degree line data i of 
affiliation of Document i (step 1 103), The processing which extracts the group number of k pieces 
sequentially from a data value whenever [ maximum affiliation ] (step 1 104), And the processing (step 
1 105) which adds the number of members at the same time it registers Document i into k corresponding 
groups If it is repeated until a publication number i is set to n (step 1 106), and a publication number i 
reaches n, the document allotment result for every group will be outputted, and processing will be 
completed (step 1107). 

[0082] (2) If it is shown in this display processing when assigning to the classification category which 
has a value whenever [ beyond a certain threshold alpha / affiliation ], each document (001-005) is 
assigned to the classification category which has a value whenever [ beyond a certain threshold alpha / 
affiliation ]. Here, as alpha, it is possible to be referred to as 1/g (g: the number of classifications), for 
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example. In the example shown in Table 3, since it is set to g= 3 and alpha= 0.33, about a document 

(001) , in the category 1 whose value is 0.33 or more whenever [ affiliation ] If attached to a document 

(002) , by the same reason, for the reason same about a document (003), it is the reason same about a 
document (005), and is assigned to a category 1 by the category 2 in a category 1 and a category 3 at 
the reason same about a document (004) at a category 2 and a category 3. this — a classification 
category (G1, G2, G3), if it arranges independently Category G1= (001, 002, 005) ; [ N 1= 3 ] Category 
G2= (003 004) ; N 2= 2 Category G3= (004 005) several documents which serve as;N 3= 2 and are 
contained in the classification group G1 — several documents with which N1 is contained in three 
pieces and the classification group G2 — several documents with which N2 are contained in two pieces 
and the classification group 3 — N3 are made into two pieces. And the document it was presupposed 
that it did in this way and was belonged to each category will be displayed in the window (HitWindow) W5 
for a retrieval result output with assignment of the group number so that it may explain to a detail later. 
[0083] An example of the program for realizing the above display processing (2) is shown in drawing 12 . 
If processing is started in this drawing, after performing setting processing (step 1201) of alpha value, 
and initialization processing (step 1202) of i, c, and Nc, The processing which extracts the group number 
of uic>alpha to the degree line data i of affiliation of Document i (step 1203), The processing (step 1204) 
which adds the number of members at the same time it registers Document i into each corresponding 
group If it is repeated until a publication number i is set to n (step 1205), and a publication number i 
reaches n, the document allotment result for every group will be outputted, and processing will be 
completed (step 1 206). 

[0084] (3) If it is shown in this display processing when assigning to a classification category in 
consideration of the probability distribution of a category, each document (001-005) is assigned to a 
classification category in consideration of the probability distribution of a category. Here, the probability 
distribution (Pc) of the classification category of a document is searched for according to several 5, and 
the number Nc of documents of Classification c is called for according to several 6. 
[0085] 
[Equation 5] 

P c = r c / v\ 
r c : (1) 0 2fj^ia ^ 



#5 

[Equation 6] 

N c=N (a) ■ P c 

N (a) : (2) o#J^Wt:£g£i£ 

»6 

In the example shown in Table 3, since it is set to P1=0.4, P2=0.2, and P3=0.4 and is set to N(0.33) = 7, 
it is set to N 1= 2.8 (about 3), N 2= 1.4 (about 1), and N 3= 2.8 (about 3). this — a classification category 
(G1, G2, G3) — if it arranges independently Category G1= (001, 002, 005) ;N 1=2 Category G2= (003) ; 
N 2= 1 Category G3= (002, 004, 005) It is set to;N 3= 2. And the document it was presupposed that it 
did in this way and was belonged to each category will be displayed in the window W5 for a retrieval 
result output (Hit Window) with assignment of the group number so that it may explain to a detail later. 
[0086] An example of the program for realizing the above display processing (3) is shown in drawing 13 . 
If processing is started in this drawing, setting processing (step 1301) of alpha value, initialization 
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processing (step 1302) of i, c, and Nc, processing (step 1303) that searches for the probability 
distribution (Pc=rc/n) of the classification category of a document, and processing (step 1304) which 
calculates Nc of the number of documents of Classification c will be performed one by one. Then, 
sorting application [ as opposed to / whenever / affiliation / of Document c / string data uic ] (step 
1305), The processing which extracts the publication number of the member of Nc individual of 
correspondence in order from a value whenever [ maximum affiliation ] (step 1306), And the processing 
(step 1307) which registers the document of Nc individual into the group c of relevance If it is repeated 
until Classification c turns into a several g classification (step 1308 NO), and Classification c reaches a 
several g classification (step 1308 YES), the document allotment result for every group will be outputted, 
and processing will be completed (step 1309). 

[0087] Next, the document assigned to each classification group in three kinds of either of allotment 
processing (1) - (3) which was explained above explains in what kind of mode it is displayed in the 
window W5 for a retrieval result output on the display screen (Hit Window). 

[0088] It sets on the screen shown in drawing 9 , and they are one of group carbon buttons (in this 
example). If assignment actuation of the group carbon button 26 is carried out, the short text sentences 
(head part of the text concerned which contains a title etc. in this example) 40-44 equivalent to the 
document assigned to each classification group in three kinds of either of allotment processing (1) - (3) 
which was mentioned above It will be displayed in the window W5 for a retrieval result output (Hit 
Window) (step 802). 

[0089] Namely, by having specified the classification group number "2" symbolized by the keyword 
group 31 ("DALAILAMA;MEET;CHINA;TIBET") in this example In the window W5 for a retrieval result 
output (Hit Window) Five text sentences 40 ("Clinton Meets With Dalai Lama) relevant to this, The text 
sentence 41 ("Clinton, Gore MeetDalai Lama on Tibetan Right), The text sentence 42 ("China Warns 
Clinton NottoMeet Dalai Lama"), The text sentence 43 ("Clinton May Meet Dalai Lama before China 
Decision") and the text sentence 44 ("Indian Leader Meet Clinton") are displayed. And since the order 
assignment carbon button 51 of a group goodness of fit described as "G" among drawing is operated, 
these text sentences 40-44 are arranged in order of a goodness of fit with the classification group 
symbolized with the specified classification group number "2" concerned, and are displayed. In addition, 
the scroll bar of the windows W4 and W5 where signs 45 and 46 are located in the left-hand side, 
respectively, and 49 are the displays of classification group number. 

[0090] [ in the window W5 for a retrieval result output (Hit Window) ] furthermore, into each head part of 
each text sentences 40-44 Three kinds of goodness of fit marks which express the goodness of fit 
which each text sentences 40-44 have to the classification group concerned to a three-stage (47a, 47b, 
47c), Three kinds of goodness of fit marks (48a, 48b, 48c) which express the goodness of fit which each 
text sentences 40-44 have to the retrieval question 22 concerned to a three-stage are displayed. The 
shape of a basic form of the goodness of fit mark (47a, 47b, 47c) which expresses a goodness of fit with 
the classification group concerned with this example is SNOW BRAND. About goodness of fit mark 47a 
equivalent to a goodness of fit "quantity", the small-circle form part of the core to black painting out 
About goodness of fit mark 47c which is further equivalent to a goodness of fit "low" in the small-circle 
form part of the core at gray painting out about goodness of fit mark 47b equivalent to a goodness of fit 
"inside", the small-circle form part of the core is considered as void. Moreover, the shape of a basic 
form is a round mark, and the goodness of fit mark (48a, 48b, 48c) showing a goodness of fit with the 
retrieval question concerned is taken as void about goodness of fit mark 48c which is further equivalent 
to a goodness of fit "low" at gray painting out about goodness of fit mark 48b which is equivalent to a 
goodness of fit "inside" at black painting out about goodness of fit mark 48a equivalent to a goodness of 
fit "quantity." 

[0091] Therefore, a user 7 can find out the target information exactly according to the contents 40-44 
of a display in this window W5 for a retrieval result output (Hit Window), checking the text set belonging 
to the group of the classification group number "2" in the text set which it is as a result of retrieval 
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sequentially from what has a high goodness of fit with this classification group "2" by considering a 
goodness of fit mark (47a, 47b, 47c) as reliance. In addition, since the goodness of fit of each text 
sentences 40-44 and the retrieval question 22 can also be known by referring to a goodness of fit mark 
(48a, 48b, 48c), much more positive retrieval narrowing down can be performed by considering both 
marks 47 and 48 as reference. In addition, although not illustrated, when the order assignment carbon 
button 50 of a retrieval question goodness of fit describe^ as "R" among drawing is operated, in drawing 
8 , classification subject guide assignment processing (step 804) is performed, and each text sentences 
40-44 will be arranged in order of a goodness of fit with the retrieval question 22 concerned, and will be 
displayed. Therefore, a retrieval result can be checked along the desired retrieval direction by any of the 
order assignment carbon button 50 of a retrieval question goodness of fit, and the order assignment 
carbon button 51 of a group goodness of fit are chosen, changing the array of each text sentences 40- 
44. 

[0092] In the condition that the retrieval result shown in drawing 9 is displayed on the other hand, if the 
actuation exchange demand carbon button (HELP) 15 is operated, it will return to drawing 8 and subject 
guide option processing (step 805) will be performed, and the display in the window W4 for theme 
classification information displays (Subject Window) will change from the above-mentioned keyword 
method to a text method, as shown in drawing 10 . Therefore, by the keyword method, even when it is 
hard to grasp the contents of the classification group concerned, according to presenting of the theme 
classification information by this text method, the theme symbolized into the classification group 
concerned can be known more exactly. In addition, when an indicative data is not settled in each window 
W4 and W5, it cannot be overemphasized that it can check scrolling the contents of a display by 
actuation of scroll bars 45 and 46. 
[0093] 

[Effect of the Invention] By the above explanation, it can become also to the guide of how by the ability 
being able to aim at improvement in the retrieval precision by prevention of the leakage in retrieval in 
improvement in retrieval effectiveness, and a list by make the check over a document retrieval result 
easy according to this invention, the theme information moreover showed extracts data efficiently, and 
puts them so that clearly, and advanced adaptation retrieval ( Relevance Feedback) can be make to 
perform using this added response indication. 



[Translation done.] 



* NOTICES * 

JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1. This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2. **** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing 1] It is the block diagram showing notionally the configuration of the text browsing equipment 
with which this invention approach and equipment were applied. 

[Drawing 2] It is the explanatory view showing notionally an operation of the fuzzy grouping section 
(Fuzzy Clustering) and the theme classification information generation section (Subject Generation). 
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[Drawing 3] It is the General flow chart which shows roughly the whole actuation of the text browsing 
equipment concerning this invention. 

[Drawing 4] It is the screen explanatory view showing the condition of having performed retrieval 
actuation without grouping processing in the text browsing equipment concerning this invention. 
[Drawing 5] It is the flow chart shown focusing on generation processing of the theme classification 
information in the text browsing equipment concerning this invention. 

[Drawing 6] It is the explanatory view showing notionally generation of the document abstraction in the 
text browsing equipment concerning this invention, and a document vector. 

[Drawing 7] It is the flow chart which shows the processing for asking for Group's c number rc of 
members in the text browsing equipment concerning this invention. 

[Drawing 8] It is the flow chart which shows dynamic processing of the retrieval result using the theme 
classification information in the text browsing equipment concerning this invention. 
[Drawing 9] It is the screen explanatory view showing the condition of having performed retrieval 
actuation accompanied by grouping processing by the keyword method in the text browsing equipment 
concerning this invention. 

[Drgwing 10] It is the screen explanatory view showing the condition of having performed retrieval 
actuation accompanied by grouping processing by the text method in the text browsing equipment 
concerning this invention. 

[Drawing 1 1] It is the flow chart which sets with the text browsing equipment concerning this invention 

to display a retrieval result according to a group, and shows the allotment processing to the 

classification category of k high orders of whenever [ affiliation / of each document ]. 

[Drawing 1 2] It is the flow chart which shows the allotment processing to the classification category 

which sets with the text browsing equipment concerning this invention to display a retrieval result 

according to a group, and has a value whenever [ beyond alpha value / affiliation ]. 

[Drawing 13] the rate to the classification category which set with the text browsing equipment 

concerning this invention to display a retrieval result according to a group, and took the probability 

distribution of a category into consideration — market-making — it is the flow chart which shows 

processing. 

[Description of Notations] 

1 Retrieval Question Input Section 

2 Document Database 

3 Search Engine 

4 Extracted Related Text Set 
4a, 4b, 4c Classification group 

5 Processing Processing Section 
5a Fuzzy grouping section 

5b Theme classification information generation section 

6 Retrieval Result Presentation Section 

7 User 

12 Input Guide Sentence 

1 3 Start Button 

14 Cancellation Carbon Button 

15 Help Button 

1 6 Scroll Bar 

1 7 Whole Sentence Demand Carbon Button 

18 The QBE Carbon Button 

19 Classification-ized Demand Carbon Button 

20 Abstract Demand Carbon Button 

21 Reset Button 
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22 Retrieval Question 

23 Title of Each Text Which Constitutes Text Set 
24a, 24b, 24c Goodness of fit mark 

25-29 Group carbon button 
30-34 Keyword group 
35-39 Text sentence 
40-44 Text sentence 
45 46 Scroll bar 

49 Display of Classification Group Number 

47a, 47b, 47c Goodness of fit mark for every group 

48a t 48b, 48c Goodness of fit mark to a retrieval question 

49 Display of Classification Group Number 

50 The Order Assignment Carbon Button of Retrieval Question Goodness of Fit 

51 The Order Assignment Carbon Button of Group Goodness of Fit 
A Original retrieval question 

B Each extracted configuration text 

calcium, Cb, Cc Theme classification information 

Da, Db, Dc A group's center of gravity 

A1 Retrieval question input area 

A2 Retrieval result output area 

W2 Window for a retrieval question input 

W3 Window for a retrieval result output 

W4 Window for theme classification information displays 

W5 Window for a retrieval result output 



[Translation done.] 
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[0 0 4 1 ] 4liIOJ:5l:LT^fl>i;> (Search so 



12 

Engine) 3 1 i tl»WSnfcmt5f*7 (R 
elevant Doc. RetrievedtlBT) T$><9, *gyrr-S><fc? 

[0 0 4 2] 5tt**9!©RaK:ffi!ST*lnUa«»T 
*0. Cl©flQlMSSfl5te7-*7 h*£4£&^+7 b 

a-r4»*J#ai;ffla-rS7 7^#l^b» (Fuzzy CI 
usteringtfBT) 5 at. £ 5 L.Tf#'bftfc&£ r ;l'— 7 

■#MIMR*£j*T*^#MMiail«« (Subject Ge 
nerationifB-T) 5 b <t£tN>i LTfltJ5£$ftTV>5. 

[0 0 4 3] 7 7^1^9 (Fuzzy Clustering) 5 
a&^l^iffii±^ (Subject Generation) 5 b 
<Df^fflSr02tC«t^WlC^-r. |1I0^*5^T, ^4T 
*Sn*l^i»KTHSnfcffl«ttlftjSx>^> (Search 

Engine) 3 tCTifll±}3ftfc^*7 1- A£ (Relevant Do 
c. Retrieved) 

[0 0 4 4] HStfcLT. fl^4a, 4b, 4cT^ 

(Fuzzy Clustering) 5 »CT#t!l£ftfc 3 ZXDifll— 7 

[0045] &nA-?*t<ti%mmQ=Mm\*. tt$n 

1&m\ZA1]2nz>*V i>j-)V&m»m (Original Quer 
y) £STo W*»BT*aft*xA0*Lfftt. t'Jyt 
;Mft*KRI (Original Query) A<DXti\z «fc O^tttU 

-r. 

[0 0 4 6] ffCa,' Cb, C c Ttk2*IS 3 <@©Q 
tfcSHfcHJtt, ^-7"4a, 4 b, 4c<Dm&zmm 
T (Group Subject) Sr^T. ft. Cft 
£>©:fegi#Siffl$BC a, Cb, Cc 
fcffl^fttf, 7^ — H/ty^ttXKn (FeedBack Quer 
y) £UT*>»iifc*>©-C»*. 

[0 0 4 7]^Da, Db, D c T7n£ft& 3fi©£l 
SDftEPtt. ^-7"4a, 4b, 4cOl-t**t. 

h*^4©l-L^St. 

[0 0 4 8] B2j&»59ie»a»£J:3fc:. 7 7y-fl^t 
SB (Fuzzy Clustering) 5a!i, ^©i&SffSftfcx 
h**&4K*#LT\ iia07 7y-f 0 5A3"J >7* 
ffiSfcJfiTiilCfctK f+7H$4S«M (-© 
««JTtt3fi) ffl^-7*4a, 4 b, 4 c \Zftm?Z>. 

£BftSflMB£j£ffi (Subject Generation) 5 b 
tt, r^ltft^nfe^^-^a, 4b, 4c©f 

SittffiCa. Cb, Cc^MtS. H^6M6*»ft«t 

Stiii^SIMICa, Cb, Celt ft ^-7*4 
a, 4b, 4c©ii>Da, Db, DctaSWfc© 
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Lfzh(Dtt±Z>. tit. :n^©77V-fl^M (Fuzz 
y Clustering) 5 a fttf£H#Sfll«£j£S& (Subject 
Generation) 5 b ajUWft&lZ-o^XIt, «1C, 

[0049] iim^t, 6ttra*{c*»w©sastc- 

ffl^-rs*SRie*«S« (Hit WindowtfHi") X&O, 
:©MSl*ai (Hit Window) 6T'tt, m'&<Dmm 

(Userite-T) 7 (c*f LTJf *T?>. fn6oasJ8« 

[0050] nitcfe^Tis. n^tc^oaanfc 

** W t J: SWUCDitn tWI K J: 0 * a ftftf*gl 
«3MMt*oTtt, «5RIS*a*ffl (Hit Window) 6 

■ctt, ®*i^ep8 sedans .fc-sc. ^««ismf#e.n 

Iftt^ftS*-*-. CftKttLT. *»9il;:»oXtt. & 
*<S*»jSaB (Hit Window) 6T*(1 ^B9C*S 

#»SfflI't (Clustercentroids) DMtfK^H 
(Group Subject) C*Tt>3i«a.— !f 7 ICJS* 
$n5CiiSSfeJ{), #tc, C®£Il«««r* (Grou 
P Subject) C^fi/i\OilT, IMitSf +Xh 

fl*«f«C (H2©C1, C2, C3tCffi^-TS) 
CD$S7^ — FA*7 Z&mMffl (FeedBack Query) Ci 
UX«3tKHA:&« (Query Input) 1 fC-^An« (0 
2»H^EP1 KcttSRKRI^tt "Query Splitting" 

£ig5K&jU&&5?s (relevance feedback) £fr:b-£3;i 
[0 0 5 1] HliiWCRIIlfcfWHIH 

xmm\zmwtz>. 

[0 0 5 2] *%W\Z%Z>7-*7, U*5R8«K*Stt*x 
-^mm<D±W^m3<D-\£^y)lyu-^^-b\Z7rs 
T. z<Dif^y)iya-^-^-blzin$n^mM 

[0053] mmiz&^Tmmmtthznzt, 

JcD^III^^ns (.X^yy°3 0 1). 



(8) 

»tt«5SRie3A*1H*A 1 taftXfcD, £fcTSBB§3 
[0 0 5 4] «*R|BIA*««A 1 rtKttttSRRHA* 

ffl©f>f >K^7W2**M:»tsnx*o. c»^^>h*^ 

W2CD±W'te. A^tf-f (Enter Query in plain 
English) 1 2*«, WJCbfcttJRx 
>->*> (Search Engine) 3 
io fe*©et!j^^> (B't'OKtEt) 13£. 

(Query) SHOiB-Tfci&oaDifiU^^ > (0tCA 
NCEL<hf2T) 14t, yXfACSLTlMif 
*#i&Sfc»©^;i/:/#*>- (H't'HELPtB-r) 1 

5 t^itenxi^. 

[0 0 5 5] ^«^*Hi^««A 2 |*3fctt«SB*S*lii* 

w3©*itH7?n-;wt- 1 6*^tte>nx^s. 

20 > (El^F u 11 Text <hfBT) 1 7 t, QBE^ 

m$t#5>> (0 + G r o u p i n g ttBT) 19i> t 

■fc#^> ([gtpS umma r i z e tfBT) 2 0i. ■ 

4>B a c k tlBf) 2 1 £1fimt*>1VC^Z>. 
[0 0 5 6] &±0#f«#?>13, 14, 1 
5, 16, 17, 18, 19, 20, 21 <DW>mt. % 

[0 0 5 7] f LT, X-hfi^ bX (Enter Query in p 
lain English) \2\z.'$Lr>X. FA>&ttXKH 
SrgSSfg CftK. d©ffi|Ttt^K) tCX. "I 
wantto know CI inton' s political condition." ©$n< 
CAAtii. d©A*anfc^K^2 2«^^ >H 

[0 0 5 8] ^CD^tCfc^X. (0*OK 
tiBT) 13»Sn5i, 031CM-PX. 
Tfi&m&MTf an, ^^x>v> (Search Engine) 3 

40 ^eaanx, tt**iH»cHa-r*7 i +^h*^4**x 
#^-^^-x2j:ojattjan, drottttsan^x*;*. 

ITiSj ) trE^LXS-rjg^Sv-^2 4 a, 2 4 b, 
2 4 c^snS. luT, Hfe^OjRbOAEpi'X 

^ansis-^sv-^ 2 4 attii^g rffij (c, Kfe^ 
so ojSt©^LEntcx^an-5jg-&s^-^ 2 4 b«ii-&s 
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[0 0 5 9] 03(CM-^T. xT-xAWCiS^T 

3 0 3) . ZLOttJBfcfc^T. g|4©iS®t;:^£ft5# 
iM*#^> (Grouping) 19 ft 5 
t, *56!8©SI«T?*4»Si<fc«l31*^ffSft* Uf 
7^3 0 6) o 
[0 0 6 0] 5 tC^-To HBfc* 

i (ci 0 . #S 7ft g ©»£*1MW -5 tt<£ t ft 

•J&gOmiZ CCWTtt F5 J! ) a«3SS7f*i. 

0 2) » 7 7^-^7^i"J>^j8!I (Fuzzy Cluste 
ringtBT) Ur-^5 0 3) , S^MIIi© 

[0 0 6 1 ] X»#»*©fl&tn$B3 (X^y^5 0 2). 

zmi&mmttz^? h)D tCctoT^^ft, #«©;* 

^«»*«j«BSItr*ff?iJtbT*3ft*. *Ofc 
», &£#©#«££ ft 5¥f§ (fi£8S) ^SliWlc^O 

2>Z\t\Z£?T. x»tt»i©ia<»ctt*ft;Sft*. 

[0 0 6 2] 

[gel] 

Fi=(fil, S2, fim) . . 

F i : Xfti ©#1^<? 
f i j : i£aSj©:fc# i 



& l 
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Clinton Singapore China ... 


Fl 


0.8 0.4 0.0 


F2 


0.6 0.7 0.0 


F3 


0.5 0.0 0.7 








£1 



muzTjiznzxw^? hJvm-s&xmQffiizmmistc 

^£116 tc^f.. £©0<jTte. wi'&<Dmsm (Clinton, 
Singapore. China-) SJffiSttfr*X«SMfcX** 
(Fl, F2, F3-) jWSBBSftTH 

*• 

[0 0 6 4] i<7ry-f • ^7 7>£"J (Xx 
7^5 0 3) Ttt. «3R*g*tLT©X»*^©i|#afT 
JHK*fU a»fflFCMMffl^T77y-f • 
20 iJ^ftfrSCfcKJ:!), *©2il©»ifl (V 
c, Ui) **£j*£ft*. 

[0 0 6 5] 1) ^JSCftSXtftft^^hMc 
[ft 2] 

Vc=(vcl, vc2, van) 

v c j:¥gl©^g^-!/coftSjt* 



» 2 

so 2) #X»©*»a*7-=f«J-^OBfJRaEU i 
[ft3] 

Ui=(uil, ui2, uig) 



(Fl, F2, F3-) 0-tft-e r ftlc$ 
£ft3lig§§ (Clinton, Singapore, China-) <DMfr 

[0 0 6 33 



&3 

40 ©mug (ui, U2, U3-) m&ftmvfr-zf (g 
1, G2, G3-) mzmznx^z. 

[0 0 6 6] 
[«2] 
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17 18 





Gl 


G2 ... G« 


Ul 


0.7 


0.2 ... 


U2 


0.8 


0.1 ... 


U3 


0.3 


0.6 ... 



&2 



[0 0 6 7] 1) 

v c \z#ft2>m?>-<D&^mm<Di¥-m : £:Mm\zkmmtiii> 

[0 0 6 8] 2) b^iS 

x+x-h (^f h;ufc-£tr) tc**u *-7-H**t 
#e.nfc^-7- h*^si«$e^^ijffl Ltx*fiTX?ji 

[0 0 6 9] ^<Dcfc5CbT#e>nfc&^;u-y©^ 

[0 0 7 0] 
[&4] 



rc 



GR c = 2R i/rc 



r c 



R i 



ItH^tLfz r c 



& 4 

w^jefcamsnfcXfHSr c (c = i, -, g;g : 

«*fe7j^0 7©7n-^^-h{c*-r. Hi 
tC&l^T, jaa*«Hi&Sn*t. rcCWt (rc = 
0) Sfr^fcO'S Uf7^7 0 1) . i ©^IIK 

©tfx-^u i \zm,xm^:(Dmms.*^isb >bn Uf 

7 7'7 0 2) . tC^tt^l/T^i^-^c® 
*>A*& r c*ij03¥Sn UfyT'7 0 3) , £X±(DM 
1 (Xr77'7 0 2, 7 0 3) 

■3 Ut7 7"7 0 4) . -t©Jn*tta«I=n (£•*) 
30 tftSiT Ur77 f 7 0 5YES) SiOig^nT, 

[0071] ^©cfcpicux, ftmumtimoQirfL (« 

[0 0 7 2] ^*^*<D»)W*^SO»aB*lI8©7 

fc«5Big£Hia»ttA 2 09 xaa i o ic^sns 
40 xo\z. ±y\z2»man. z.n\z^o±m»m^mm 

^ffl-7^>F? (Subject Window) W4t*3Rie»tB* 
m^-OFO (Hit Window) WSt^ms. fit, 

2 (Subject Window) W4 

[0 0 7 3] +-7-H*iCCJ:4a*BffiO— 
9 (3k?. ft. JKDWTtt. M$nfcf+^ 
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=fcotc, 3iMftm®®iik*mV 4 >FO (Subject Wind 
ow) W4fti:ii ^<D£.m%l\zmo i^lcbT, Km? 

5<iCD^;i/-7 P #^>2 5~2 9^±T— ?"Jl;:gEB£*i 

t&o, ^-ne>©i/;p— y#^>2 5~2 9®^<B9»r 

3 o~3 4**E*ij$nT^s. cuowm. a-siou- 

r i j \zftmTZ>t?)i-zf-fc$>2 5<D&M\Z 
tt, *-7-H»3 0 tbT, "SINGAPORE; CANE; PUN IS 
H;US" ^S^SttT&D. *HBi^-:7#*t C 2 j tC*f 
f&1-^>i7';l/-y^^>2 6©£ffliJ(Cte, +-7-KS3 
ltLT, "DALA I LAMA ; MEET ;CHINA;TIBET" «SStl 
T&D. ^«^;U--/S^ IT 3 J ^MJCTS^-^ 
^>2 7©t«fclt *-7-H»3 2tlT, "MEE 
T; LEADER; GOVERNMENT; OFFICIAL" il«*SSnT6D, # 

f4j \zttm-fz>>?)V--JX$>2 8© 

ftM\Z\t, +-7-H»3 3i:LT, "NIXON ; NATION ; S 
INGAPORE; DIRECTIVE" *»ai*3ftT*0, ^ 

+-7-HI3 4 tVX. "QUEST I ON; CHARACTER ;PE0PL 
E;POLITICS" ifi&mStlT^Z. 

[0074] ^ne©±a»siw*«.' 

&ftfc»jSJ6#fcfi6^, (Query) toMSKO 

«. frgi^i'-TW r i j fcT**sn*±B***t) 
«««« »a^^-^ tsi \z 

xmwiznz^mtimh&mmfflttDm-smfim^z. t 

>K7 (Subject Window) W4|^0gfI#A^, g# 

-HP 3 0~3 4©rt^(rS^T, S-^S^-y©^ 
m*±&fr\zmV&-tZ>Z.ttf-X:%Z>. fLT, ii;BL 

[0 0 7 5] x^T. h73^tC<t^a^ili®©-0*JS:^ 1 

smnftmtffr-zfizftmznT^z. HHc^sn* 

ct^fc. £H#S«««*JB'7-r >K£ (Subject Wind 
ow) W4^tC«. ^<D&mmz%i?£ O \Z LT, 

r i j rsj c*ftf;-t<s 

5®©£MU-7 P #^>2 5~2 9*»±T— j(icE«sn 
T&O. -?-ne>©^;i/-y^^>2 5~2 9©^{giJtC 

£3 5~3 9tfmm2tlT^2>. d©0ijT«< fl-fgdOl/ 

r i j \zttfotz>?)i-zf#$>>2 b<D^m\z 

\t. fWh5:3 5iLT, "Clinton Protest Singap 
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20 

ore Caning. Mulls Response " A*S^£*lT:fe # 
mif)V-y°mn r 2 j ic*ueTS^-^^>2 6© 
*{8'JIC«« x+7> hX3 6 t LT> "Clinton Meets Wi 
th Dalai Lama" ^gnt*»). 
IT 3 j fc»«T*^;W-^5'>2 7©*i«lctt. 7"* 
7% b;£3 7 tLT, "IndianLeader Meet Clinton" 

;i/-7 P ^^>2 8©*(BiJtC«. f^hX38tLT, 
"Nixon Had LivingWill" ^*SSnt*t3. ftM? 
10 rsji tc^j£-r-5f )V—7#9>2 9©*ffl 

tcte, f+Xhi3 9 t LT- "Clinton News Confere 
ns-Text" ^SStlTl/>5. 
[0 0 7 6] cn&OiH^StiMBK^Tt), 

9c\z&tb<bntzm*nm\zm\ mmmm (Query) t© 
ji-&s©ffiv^©^e)ntgE^j$nTu§ 0 

^^;l/-7°#^ f5j KT*iSn5»«^-7'0± 
B#S t>«HRKn t ©ig£&7W£^ £ 1 1 3 . o 
20 T< a.— tf- 7 «±B#§Hf$6*7*ffl 7 -f> K "7 (Subj 
ect Window) W4|*3©S*ISd^, g#©Jf 

b*» ! b-tn-€ f norts&iBttira-r7-*^ hX3 5-3 

fc. fl-StMSf^MS (7f77"8 0 2) ^^KjT^.^ 
[0 0 7 7] Jfclc, fttCl&HJiU;t:7 7>M' • ^^X^'J 

ui$ffliifc> &m%£%:(Dm%s^<Drzi£)(Dmw\z~D^ 
Twm\zmw?z>o «, ncai-m. g-«tt&<z>A*i£ 
*©fc«&icte3as©fflsasffl&£nT&»3, ;:ne>© 
jaatt0 9x«0 1 0 ir^sn-5iBffiic*3^T. 

7'^5'>2 5~2 9 0^1*n*»— 3*«kf^r*^i:I:J: 
Oigft^nS (7>^-y7°8 0 2) „ 
[0 0 7 8] SfclCiaidLfcctStC, 

7 7-/-( • 7 773"J >d/*ff -^ntCtO^^ff© 
40 ft^«*x^'J-^03fJ»«U i *«*»6nTti*. ^> 
<K(C, 5(BOXi (0 0 1, 002, 003, 004, 

005) a<#ftu ^^©^©-s-n-entc-p^xs 
3) ©-tn-fntcwf *0fRK**3S3©aoT?**t« 

[0 0 7 9] 
[S3] 
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21 22 





i 


* 2 


iJf=fV 3 


0 0 1 


0. 5 0 


0. 3 0 


0.2 0 


0 0 2 


0.6 0 


0.10 


0.3 0 


0 0 3 


0.10 


0.8 0 


0.10 


0 0 4 


0.2 5 


0. 3 4 


0.4 1 


0 0 5 


0.3 5 


0.10 


0.5 5 



S3 



a (i) ~ (3) ssigirrs. 
[0080] (i) &xw<Dmmm<D±itikm<Dftm. 

CCD^^aat&oTte, (0 0 1 ~0 0 5) « 

R/f n s © « ^ & © mi t a «n k (@ » # mti r- a' u 

j£) . X» (0 0 1) tOI^Tte«*mJSS0. 5 0T' 
SSAt^'U 1 t, X» (0 0 2) (CC3^Ttt«^f5f)S 
S.O. 6 OTS-S^^rf'J 1 t, (0 0 3) tt3H 

T«S*E>rJS£0. 8 0T*5Af 2 (0 
0 4) tCO^Tteft^ffiJggO. 4 1 T$>-5*^rf'J 3 

t. (005) iz^^xitmxmm&o. 5 5T& 
sato'u 3 tc^n^nwo fattens, cn^^s* 
x3*u (gi, G2, G3) mzmwtzt. 



10 * 



A^n'U G 1 = (001, 002) 
*f3*'JG2= (0 0 3) 

J'JG3= (0 0 4, 0 0 5) 



N 1 = 2 
N 2= 1 
N 3 = 2 

: SCN 1 «2 



ffl. ^^)U-^G2fC-^^n?>X#»N2«Hl, # 

L-T, dWj;5tUT##T:3*U tM^&Cit^tt*: 

^t^tct^-r^j^t, 
^i^d^^mtil^ffl^^OF^ (HitWindow) W5 
ft tgj^ $ ^ 1 1 -5 „ 
[0 0 8 1 ] £(±©^11 (1) £31911- 7* 

#7^UG 1 = (0 0 1, 0 0 2, 
Af J'JG2= (0 0 3, 0 0 4) 
AfJUG3= (0 0 4, 0 0 5) 

t&o. ftm?)i--JG i t-££ns:sc#»N i (J3 

fl. 5r«^)U--/G2t'^^n-5X#|gcN2«2®. # 40 

«y;u-^3tc^^n^>s:»©:N3«2fii$n^o ■* 

££#fc«5itil6*tti:&ffl£-r >F£ (HitWindow) W5 
[0 0 8 3] «±0*w«ia (2) fe^ST**:*©:/ 

p^^AcD-^ji&Hi 2tc^-r 0 rasicfe^Tjaa^H 

te$m>t, attCD|g^®a Uf7/1 2 0 1) RZt 
i, c, N c ©iDJWflsSaa Ur'^1 2 0 2) SrSfT 
tfcft, X» i ©HTrmSfTX-^ i t»T4 u i c > a so 



&$n&£, kffi©!£5£MS Ufy^l 10 1) 
i, c, N c ©WJWMBS Uf"^110 2) SrHfr 
LfcfS, ;£# i (Dffimmft^-? i V- hflflS 

(XzryZfl 10 3). «*mJl«X-*<Bfr£JlIt k 

4) , Rztmm-rz>km<Dtf)\<-7\zx9 1 £2®-rs 
t.mm\z*>n%L*iim-?z>®M Uf^nos) 

106) . XtW 1 A«nlcat5i#^-y*©X 

»wo#ttig**m*bTfflaA**!f7 Uf^i i o 

7) T3. 

[0 0 8 2] (2) **PBffio£Jl±OBf«ftffi*Wr 

^©a^t&gt&oTte, (0 0 1~0 0 5) « 

»*H«oa±O0fJR«ffl**"r45i-IS*7 L =f u two 
Wt^W. CiT*. aiLTIl #J;U£l/g (g : 

tnctmen*. «3(c^$n-5^jx- 

«. g = 3> o=0. 3 3tWfc», (0 0 1) 

trn>T«ffiJSgfil#0. 3 3£t±T»**x^'J 1 
t, 5C« (0 0 2) tOV>Ttel^«ft3lilT*5^U 1 
t, X* (0 0 3) (CO^Tteli^&Stfi-C^a'U 2 

t, (004) \z~D\,*xtem<mttm&-vftT3v 2 
t*T-=fu 3t, x# (0 0 5) tt3v»Tttra«&aa 

TAtt^U 1 t^Jx^'J 3 tfiJDtttt^ns. dn^r^r 
Sar-rf'J (Gl, G2, G3) gy t bests 
0 0 5) ; N 1 =3 
N2 = 2 
N3 = 2 

<Dif)i-zfm^^mtii-r^mm (xTy?i 203), 
»ar*ft>;u-7 p tx» i &ae-r-stisiB#t^>A* 
i^ns-rs^s Uf7yi 204) i 

atntfrSST&DjISn Uf»;^l 2 0 5) , Xf 

*m*UTffla*««7 Uf^l 2 0 6) T*. 
[0 0 8 4] (3) *x^U©«*»*S#ltbT» 

Iffl«ilIl:*oTH (0 0 1-0 0 5) tt 

^n^o :iT, XtO»«*f JU-©**»* (P 
c) tt»5t«eoT*»6n, *fc»JSc©X»»Nc 
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23 24 

[0 0 8 5] 
[»53 

P c = r c /x\ 

r c : (1) <D2«fcfr5fcfcJt »? 



«5 



Nc = N (a) • P c 
N (a) I (2) OfJWt^K«fcj) . 



ft 6 

S3 (C^nSt&JTte, P 1 = 0 . 4, P 2 = 0. 2. 

P3 = 0. 4tft0. £fcN (0. 3 3) = 7 tttZfz 
Nl = 2. 8 (ft 3) , N2=l. 4 (ft 1 ) , N* 

t>7-zi*) G 1 = (001, 
*T-rf'J G 2= (0 0 3) 
*f3 , UG3= (0 0 2, 

ift*. f-bT, ilW.fc^KL/Tg-rtx^'JtCjg-f 5£ 

^#*0«)Et*K««tt*m*ffll>-f (Hit Wi 

ndow) W 5 rtta^KSft*;: ttfti. 
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